10 research outputs found

    Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Microarray technologies produced large amount of data. In a previous study, we have shown the interest of <it>k-Nearest Neighbour </it>approach for restoring the missing gene expression values, and its positive impact of the gene clustering by hierarchical algorithm. Since, numerous replacement methods have been proposed to impute missing values (MVs) for microarray data. In this study, we have evaluated twelve different usable methods, and their influence on the quality of gene clustering. Interestingly we have used several datasets, both kinetic and non kinetic experiments from yeast and human.</p> <p>Results</p> <p>We underline the excellent efficiency of approaches proposed and implemented by Bo and co-workers and especially one based on expected maximization (<it>EM_array</it>). These improvements have been observed also on the imputation of extreme values, the most difficult predictable values. We showed that the imputed MVs have still important effects on the stability of the gene clusters. The improvement on the clustering obtained by hierarchical clustering remains limited and, not sufficient to restore completely the correct gene associations. However, a common tendency can be found between the quality of the imputation method and the gene cluster stability. Even if the comparison between clustering algorithms is a complex task, we observed that <it>k-means </it>approach is more efficient to conserve gene associations.</p> <p>Conclusions</p> <p>More than 6.000.000 independent simulations have assessed the quality of 12 imputation methods on five very different biological datasets. Important improvements have so been done since our last study. The <it>EM_array </it>approach constitutes one efficient method for restoring the missing expression gene values, with a lower estimation error level. Nonetheless, the presence of MVs even at a low rate is a major factor of gene cluster instability. Our study highlights the need for a systematic assessment of imputation methods and so of dedicated benchmarks. A noticeable point is the specific influence of some biological dataset.</p

    Etude de la réponse de Saccharomyces cerevisiae à une perturbation NADPH par une approche de biologie des systèmes

    Get PDF
    L'élucidation des propriétés du réseau métabolique est fondamentale pour la compréhension du fonctionnement cellulaire et pour l'élaboration de stratégies d'ingénierie métabolique. L'objectif de cette thèse était de mieux comprendre la régulation du métabolisme du NADPH, un métabolite "hub" qui joue un rôle central dans de nombreux processus cellulaires, chez Saccharomyces cerevisiae en fermentation. Nous avons utilisé une démarche systématique couplant modélisation et approches multi- omics pour étudier de façon quantitative la réponse à une perturbation de la demande en NADPH. Un système expérimental original, basé sur l'expression d'une butanediol déshydrogénase modifiée NADPH-dépendante a été utilisé pour augmenter de façon contrôlée la demande en NADPH. L'utilisation de ce dispositif, le développement et l'utilisation d'un modèle stœchiométrique de la levure dédié à la fermentation ont permis de prédire la répartition des flux pour différents niveaux de perturbation. Ces analyses ont montré, en premier lieu, la très grande capacité de la levure à faire face à des demandes très importantes de NADPH représentant jusqu'à 40 fois la demande anabolique. Pour des demandes modérées (allant jusqu'à 20 fois la demande anabolique), la perturbation est principalement compensée par une augmentation du flux à travers la voie des pentoses phosphate (VPP) et à moindre titre à travers la voie acétate (Ald6p). Pour une forte demande en NADPH, correspondant à 40 fois la demande anabolique, le modèle prédit la saturation de la VPP ainsi que la mise en place du cycle glycérol-DHA, qui permet l'échange du NADH en NADPH. Des analyses fluxomique (13C), métabolomique et transcriptomique, ont permis de valider ces hypothèses et de les compléter. Nous avons mis en évidence différents niveaux de régulation selon l'intensité de la perturbation : pour les demandes modérées, les flux sont réajustés par un contrôle au niveau enzymatique ; pour de fortes demandes, un contrôle transcriptionnel de plusieurs gènes de la VPP ainsi que de certains gènes des voies de biosynthèse des acides aminés est observé, cet effet résultant probablement de la moindre disponibilité en NADPH. Dans l'ensemble, ce travail a apporté un nouvel éclairage sur les mécanismes impliqués dans l'homéostasie du NADPH et plus généralement dans l'équilibre redox intracellulaire.The elucidation of the properties of metabolic network is essential to increase our understanding of cellular function and to design metabolic engineering strategies. The objective of this thesis was to better understand the regulation of the metabolism of NADPH, a hub metabolite which plays a central role in many cellular processes in Saccharomyces cerevisiae during fermentation. We used a systematic approach combining modeling and multi- omics analyses to study quantitatively the response to a perturbation of the NADPH demand. An original experimental system, based on the expression of a modified NADPH-dependent butanediol dehydrogenase was used to increase the demand for NADPH in a controlled manner. Through the use of this device and the development and use of a stoichiometric model of yeast dedicated to the fermentation, we predicted the flux distribution for different levels of perturbation. These experiments showed, first, the overwhelming ability of yeast to cope with very high NADPH demand, up to 40 times the anabolic demand. For a moderate level (up to 20 times the anabolic demand), the perturbation is mainly compensated by increased flux through the pentose phosphate pathway (PPP) and to a lesser extent through the acetate pathway (Ald6p). For a high NADPH demand, corresponding to 40 times the anabolic demand, the model predicts the saturation of the PPP as well as the operation of the glycerol-DHA cycle, which allows the exchange of NADH to NADPH. Fluxomics (13C), metabolomics and transcriptomics data were used to validate and to complement these hypotheses. We showed different levels of control depending on the intensity of the perturbation: for moderate demands, flux remodeling is mainly achieved by enzymatic control; for a high demand, a transcriptional control is observed for several genes of the PPP as well as some genes of the amino acids biosynthetic pathways, this latter effect being likely due to the low NADPH availability. Overall, this work has shed new light on the mechanisms governing NADPH homeostasis and more generally the intracellular redox balance.MONTPELLIER-SupAgro La Gaillarde (341722306) / SudocSudocFranceF

    Study of the response to NADPH perturbation by a systems biology approach in Saccharomyces cerevisiae

    No full text
    L'élucidation des propriétés du réseau métabolique est fondamentale pour la compréhension du fonctionnement cellulaire et pour l'élaboration de stratégies d'ingénierie métabolique. L'objectif de cette thèse était de mieux comprendre la régulation du métabolisme du NADPH, un métabolite "hub" qui joue un rôle central dans de nombreux processus cellulaires, chez Saccharomyces cerevisiae en fermentation. Nous avons utilisé une démarche systématique couplant modélisation et approches multi-“omics” pour étudier de façon quantitative la réponse à une perturbation de la demande en NADPH. Un système expérimental original, basé sur l'expression d'une butanediol déshydrogénase modifiée NADPH-dépendante a été utilisé pour augmenter de façon contrôlée la demande en NADPH. L'utilisation de ce dispositif, le développement et l'utilisation d'un modèle stœchiométrique de la levure dédié à la fermentation ont permis de prédire la répartition des flux pour différents niveaux de perturbation. Ces analyses ont montré, en premier lieu, la très grande capacité de la levure à faire face à des demandes très importantes de NADPH représentant jusqu'à 40 fois la demande anabolique. Pour des demandes modérées (allant jusqu'à 20 fois la demande anabolique), la perturbation est principalement compensée par une augmentation du flux à travers la voie des pentoses phosphate (VPP) et à moindre titre à travers la voie acétate (Ald6p). Pour une forte demande en NADPH, correspondant à 40 fois la demande anabolique, le modèle prédit la saturation de la VPP ainsi que la mise en place du cycle glycérol-DHA, qui permet l'échange du NADH en NADPH. Des analyses fluxomique (13C), métabolomique et transcriptomique, ont permis de valider ces hypothèses et de les compléter. Nous avons mis en évidence différents niveaux de régulation selon l'intensité de la perturbation : pour les demandes modérées, les flux sont réajustés par un contrôle au niveau enzymatique ; pour de fortes demandes, un contrôle transcriptionnel de plusieurs gènes de la VPP ainsi que de certains gènes des voies de biosynthèse des acides aminés est observé, cet effet résultant probablement de la moindre disponibilité en NADPH. Dans l'ensemble, ce travail a apporté un nouvel éclairage sur les mécanismes impliqués dans l'homéostasie du NADPH et plus généralement dans l'équilibre redox intracellulaire.The elucidation of the properties of metabolic network is essential to increase our understanding of cellular function and to design metabolic engineering strategies. The objective of this thesis was to better understand the regulation of the metabolism of NADPH, a “hub” metabolite which plays a central role in many cellular processes in Saccharomyces cerevisiae during fermentation. We used a systematic approach combining modeling and multi-“omics” analyses to study quantitatively the response to a perturbation of the NADPH demand. An original experimental system, based on the expression of a modified NADPH-dependent butanediol dehydrogenase was used to increase the demand for NADPH in a controlled manner. Through the use of this device and the development and use of a stoichiometric model of yeast dedicated to the fermentation, we predicted the flux distribution for different levels of perturbation. These experiments showed, first, the overwhelming ability of yeast to cope with very high NADPH demand, up to 40 times the anabolic demand. For a moderate level (up to 20 times the anabolic demand), the perturbation is mainly compensated by increased flux through the pentose phosphate pathway (PPP) and to a lesser extent through the acetate pathway (Ald6p). For a high NADPH demand, corresponding to 40 times the anabolic demand, the model predicts the saturation of the PPP as well as the operation of the glycerol-DHA cycle, which allows the exchange of NADH to NADPH. Fluxomics (13C), metabolomics and transcriptomics data were used to validate and to complement these hypotheses. We showed different levels of control depending on the intensity of the perturbation: for moderate demands, flux remodeling is mainly achieved by enzymatic control; for a high demand, a transcriptional control is observed for several genes of the PPP as well as some genes of the amino acids biosynthetic pathways, this latter effect being likely due to the low NADPH availability. Overall, this work has shed new light on the mechanisms governing NADPH homeostasis and more generally the intracellular redox balance

    A constraint-based model analysis of the metabolic consequences of increased NADPH oxidation in Saccharomyces cerevisiae.

    No full text
    International audienceControlling the amounts of redox cofactors to manipulate metabolic fluxes is emerging as a useful approach to optimizing byproduct yields in yeast biotechnological processes. Redox cofactors are extensively interconnected metabolites, so predicting metabolite patterns is challenging and requires in-depth knowledge of how the metabolic network responds to a redox perturbation. Our aim was to analyze comprehensively the metabolic consequences of increased cytosolic NADPH oxidation during yeast fermentation. Using a genetic device based on the overexpression of a modified 2,3-butanediol dehydrogenase catalyzing the NADPH-dependent reduction of acetoin into 2,3-butanediol, we increased the NADPH demand to between 8 and 40-fold the anabolic demand. We developed (i) a dedicated constraint-based model of yeast fermentation and (ii) a constraint-based modeling method based on the dynamical analysis of mass distribution to quantify the in vivo contribution of pathways producing NADPH to the maintenance of redox homeostasis. We report that yeast responds to NADPH oxidation through a gradual increase in the flux through the PP and acetate pathways, providing 80% and 20% of the NADPH demand, respectively. However, for the highest NADPH demand, the model reveals a saturation of the PP pathway and predicts an exchange between NADH and NADPH in the cytosol that may be mediated by the glycerol-DHA futile cycle. We also reveal the contribution of mitochondrial shuttles, resulting in a net production of NADH in the cytosol, to fine-tune the NADH/NAD(+) balance. This systems level study helps elucidate the physiological adaptation of yeast to NADPH perturbation. Our findings emphasize the robustness of yeast to alterations in NADPH metabolism and highlight the role of the glycerol-DHA cycle as a redox valve, providing additional NADPH from NADH under conditions of very high demand

    Harnessing virtual machines to simplify next-generation DNA sequencing analysis

    No full text
    Motivation: The growth of next-generation sequencing (NGS) has not only dramatically accelerated the pace of research in the field of genomics, but it has also opened the door to personalized medicine and diagnostics. The resulting flood of data has led to the rapid development of large numbers of bioinformatic tools for data analysis, creating a challenging situation for researchers when choosing and configuring a variety of software for their analysis, and for other researchers trying to replicate their analysis. As NGS technology continues to expand from the research environment into clinical laboratories, the challenges associated with data analysis have the potential to slow the adoption of this technology. Results: Here we discuss the potential of virtual machines (VMs) to be used as a method for sharing entire installations of NGS software (bioinformatic 'pipelines'). VMs are created by programs designed to allow multiple operating systems to co-exist on a single physical machine, and they can be made following the object-oriented paradigm of encapsulating data and methods together. This allows NGS data to be distributed within a VM, along with the pre-configured software for its analysis. Although VMs have historically suffered from poor performance relative to native operating systems, we present benchmarking results demonstrating that this reduced performance can now be minimized. We further discuss the many potential benefits of VMs as a solution for NGS analysis and describe several published examples. Lastly, we consider the benefits of VMs in facilitating the introduction of NGS technology into the clinical environment

    A comparative transcriptomic, fluxomic and metabolomic analysis of the response of <it>Saccharomyces cerevisiae</it> to increases in NADPH oxidation

    No full text
    <p>Abstract</p> <p>Background</p> <p>Redox homeostasis is essential to sustain metabolism and growth. We recently reported that yeast cells meet a gradual increase in imposed NADPH demand by progressively increasing flux through the pentose phosphate (PP) and acetate pathways and by exchanging NADH for NADPH in the cytosol, via a transhydrogenase-like cycle. Here, we studied the mechanisms underlying this metabolic response, through a combination of gene expression profiling and analyses of extracellular and intracellular metabolites and <sup>13</sup> C-flux analysis.</p> <p>Results</p> <p>NADPH oxidation was increased by reducing acetoin to 2,3-butanediol in a strain overexpressing an engineered NADPH-dependent butanediol dehydrogenase cultured in the presence of acetoin. An increase in NADPH demand to 22 times the anabolic requirement for NADPH was accompanied by the intracellular accumulation of PP pathway metabolites consistent with an increase in flux through this pathway. Increases in NADPH demand were accompanied by the successive induction of several genes of the PP pathway. NADPH-consuming pathways, such as amino-acid biosynthesis, were upregulated as an indirect effect of the decrease in NADPH availability. Metabolomic analysis showed that the most extreme modification of NADPH demand resulted in an energetic problem. Our results also highlight the influence of redox status on aroma production.</p> <p>Conclusions</p> <p>Combined <sup>13</sup> C-flux, intracellular metabolite levels and microarrays analyses revealed that NADPH homeostasis, in response to a progressive increase in NADPH demand, was achieved by the regulation, at several levels, of the PP pathway. This pathway is principally under metabolic control, but regulation of the transcription of PP pathway genes can exert a stronger effect, by redirecting larger amounts of carbon to this pathway to satisfy the demand for NADPH. No coordinated response of genes involved in NADPH metabolism was observed, suggesting that yeast has no system for sensing NADPH/NADP<sup>+</sup> ratio. Instead, the induction of NADPH-consuming amino-acid pathways in conditions of NADPH limitation may indirectly trigger the transcription of a set of PP pathway genes.</p

    Epigenetic regulation of GATA2 and its impact on normal karyotype acute myeloid leukemia

    No full text
    The GATA2 gene encodes a zinc-finger transcription factor that acts as a master regulator of normal hematopoiesis. Mutations in GATA2 have been implicated in the development of myelodysplastic syndrome and acute myeloid leukemia (AML). Using RNA sequencing we now report that GATA2 is either mutated with a functional consequence, or expressed at low levels in the majority of normal karyotype AML (NK-AML). We also show that low-GATA2-expressing specimens (GATA2(low)) exhibit allele-specific expression (ASE) (skewing) in more than half of AML patients examined. We demonstrate that the hypermethylation of the silenced allele can be reversed by exposure to demethylating agents, which also restores biallelic expression of GATA2. We show that GATA2(low) AML lack the prototypical R882 mutation in DNMT3A frequently observed in NK-AML patients and that The Cancer Genome Atlas AML specimens with DNMT3A R882 mutations are characterized by CpG hypomethylation of GATA2. Finally, we validate that several known missense single-nucleotide polymorphisms in GATA2 are actually loss-of-function variants, which, when combined with ASE, represent the equivalent of homozygous GATA2 mutations. From a broader perspective, this work suggests for the first time that determinants of ASE likely have a key role in human leukemia
    corecore